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Abstract —In this paper, we consider the joint design of data 
compression and 802.15.4-based medium access control (MAC) 
protocol for smartgrids with renewable energy. We study the set¬ 
ting where a number of nodes, each of which comprises electricity 
load and/or renewable sources, report periodically their injected 
powers to a data concentrator. Our design exploits the correlation 
of the reported data in both time and space to perform efficient 
data compression using the compressed sensing (CS) technique 
and efficiently engineer the MAC protocol so that the reported 
data can be recovered reliably within minimum reporting time. 
Specifically, we perform the following design tasks: i) we employ 
the two-dimensional (2D) CS technique to compress the reported 
data in the distributed manner; ii) we propose to adapt the 
802.15.4 MAC protocol frame structure to enable efficient data 
transmission and reliable data reconstruction; and iii) we develop 
an analytical model based on which we can obtain the optimal 
parameter configuration to minimize the reporting delay. Finally, 
numerical results are presented to demonstrate the effectiveness 
of our design. 

Index Terms —CSMA MAC protocols, renewable energy, com¬ 
pressed sensing, smartgrids, and power line communications. 

1. Introduction 

Future energy grid is expected to integrate more distributed 
and renewable energy resources with significantly enhanced 
communications infrastructure for timely and reliable data 
exchanges between the control center and various grid con¬ 
trol points m. The advanced communications infrastructure 
supports many critical grid control, monitoring, and manage¬ 
ment operations and emerging smartgrid applications. The 
communications infrastructure is typically hierarchical, e.g., 
data communications between customers and local concen¬ 
trators via field/neighborhood area networks, and between 
local concentrators and the utility company via long-haul wide 
area networks na, 0. The former is usually based on the 
low bandwidth communications technologies such as Zigbee, 
WiFi, and power line communications (PLC) while the later 
is required to have higher capacity, which can be realized by 
employing LTE, 3G cellular, WiMAX, and fiber optics for 
example . 

Our work concerns the design of data compression and 
MAC protocol for the field/neighborhood area network where 
PLC is employed to report injected powers from grid connec¬ 
tion points to the local concentrator. In fact, several smartgrid 
projects in France (H, and Spain lO have chosen PLC for 
smartgrid deployment since PLC can be realized with low- 
cost modems and it utilizes available electricity wires for data 
communications. We focus on the reporting of injected powers 
at different grid connection points since this information can 
be employed for many grid applications such as line-failure 
prediction Q or congestion management and various grid 
control applications (a. Furthermore, the utility control center 
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can utilize collected data to further estimate the complete 
phasor data at different nodes which can be then used in the 
control of voltages and reactive powers or in the active load 
management a, 0, and outage management El. 

There have been some existing works that study data com¬ 
pression and MAC protocol design issues in the smartgrid and 
wireless network contexts. In addition, there are two popular 
standards for the PLC technology, namely PRIME and G3- 
PLC El-CIl. In a, the authors study the state estimation 
problem where the voltage phasors at different nodes are 
recovered based on limited reported data. The authors in 
lEl consider random access exploiting the CS capability for 
energy efficiency communications in wireless sensor networks. 
However, joint design of communications access and data 
compression for smartgrid is still very under-explored by the 
existing literature. 

In this paper, we propose to engineer the PRIME MAC 
protocol jointly with CS-based data compression for smartgrid 
communications networks. Specifically, we consider a dis¬ 
tributed random reporting (DRR) mechanism where each grid 
connection point (referred to as a node in the sequel) reports 
its injected power data in a probabilistic manner and using the 
PRIME MAC protocol for data transmission. The Kronecker 
CS technique (2D CS), which can exploit the spatio-temporal 
correlation of data, is then employed for data reconstruction at 
the utility control center. We develop an analytical model for 
the proposed design based on which we can guarantee reliable 
data construction. In addition, we present an algorithm which 
determines efficient configuration for MAC parameters so that 
the average reporting time is minimized. 

II. System Model 

We consider the one-hop communications between a group 
of grid nodes and one data concentrator (point to multi- 
points). We assume that each node is equipped with a smart 
meter (SM) that reports injected power data to the data 
concentrator using plcQ. Moreover, each node is assumed 
to comprise a load and/or a solar/wind generator. Large- 
scale deployment of such renewable generators at customer 
premises, the light poles, or advertising panels has been 
realized for urban areas in Europe O, Q. In reality, the data 
concentrator is installed at the same location as the transformer 
to collect data from grid nodes of the distribution network. 
This communication and grid model is shown in Fig. [T] 

We assume that each node must report the injected power 
value once in each reporting interval (RI) (H, O. For most 
available smart meters, the RI can be configured from a few 
minutes to hours lEl. Let Si^i, Sg^i and be the load, 

^We implicitly assume that the wide area network has very high bandwidth 
and can deliver the received data at the data concentrators to the utility control 
center without errors 
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Fig. 1. Smartgrids with advanced communications infrastructure 


distributed generation and injected powers at node i, respec¬ 
tively. Then, the injected power at node i can be expressed as 
Si = S^^i — S/^i. We assume that the control center wishes 
to obtain information about Si for all nodes G [1,^^]) in 
each RI0 Under the constraint of low bandwidth properties 
of PLC our objectives are to develop joint design 

of the data compression and MAC protocol so that reliable 
data reporting can be achieved within minimum reporting time 
(RT). One important design requirement target is that the MAC 
protocol is distributed which allows low-cost and large-scale 
deployment. 

It has been shown in some recent works that power data 
related to power grids with renewable energy exhibit strong 
correlation (implying sparsity) over space and time M, ca. 
Therefore, to realize the underlying compression benefits, 
the control center only needs to collect the reported data in 
tut < riT RIs (i.e., compression over time) from a subset of 
ns nodes with size ms < ns (i.e., compression over space) 
to reconstruct the complete data for ns nodes and nr RIs. 
Note that the reconstructor can be at the data concentrator 
or at the control center. The later is chosen in this work as 
the best choice for reducing bandwidth usage in the second 
phase. We can easily extend to apply our proposed method 
to the applications where each node would report their load 
and distributed generation powers instead of injected powers. 
Because these load and distributed generation powers also have 
strong correlation as presented above. 

For practical implementation, the data transmission and 
construction can be performed in a rolling manner where data 
construction performed at a particular RI utilizes the reported 
data over the latest nr RIs. To guarantee the desirable data 
construction quality, the control center must receive sufficient 
data which is impacted by the employed reporting mechanism 
and MAC protocol. Specifically, we must determine the values 
of mT and ms for some given values nT and to achieve 
the desirable data construction reliability. 

^The proposed framework can be applied to other types of grid data as 
long as they exhibit sparsity in the time and/or space domains. 


III. CS-Based Data Compression 

A. CS-Based Data Processing 

Without loss of generality, we consider data construction 
for one data field for nr RIs and ns nodes. Let Z be an 
ns X nT matrix whose (i,jf)-th element denotes the injected 
power at node i and RI j. We will refer to the data from one 
RI (i.e., one column of Z) as one data block in this paper. 
From the CS theory, we can compress the data if they possess 
sparsity properties in a certain domain such as wavelet domain. 
Specifically, we can express the data matrix Z as 

Z = (1) 

where G and ^ denote wavelet 

bases in space and time dimensions, respectively C3. The 
sparsity of Z can be observed in the wavelet domain if matrix 
A has certain K significant (nonzero) coefficients where K < 

ns X nT- 

We now proceed to describe the data compression and 
reconstruction operations. Let us denote G (for 

space) and ^ time) as the two sparse 

observation matrices where entries in these two matrices are 
i.i.d uniform random numbers where ms < ns and mr < 
Specifically, we can employ ^s and to sample the power 
data from which we obtain the following observation matrix 

Y = . (2) 

Let = nrns and M = mrms be the number of 
elements of Z and Y, respectively. From the CS theory, we can 
reliably reconstruct the data matrix Z by using the observation 
matrix Y if mr and ms are appropriately chosen. For the 
smartgrid communications design, this implies that the control 
center only needs to collect M injected power elements instead 
of Ye values for reliable construction of the complete data 
field. 

We now describe the data construction for Z by using the 
observation matrix Y. To achieve this, the control center can 
determine matrix A, which corresponds to wavelet transform 
of the original data Z as described in o, by solving the 
following optimization problem 

rmn||A ||2 s.t. ||vec (Y) — Yjj^ < e (3) 

where Y = (^5 (g) ^t) {^s C) ^t) vec (A), (g) is the Kro- 
necker product, vec (X) denotes the vectorization of the 
matrix X formed by stacking the rows of X into a single 
column vector. We can indeed solve problem (O by using the 
Kronecker CS algorithm lEl to obtain A* = argmin^ ||A||2. 
Then, we can obtain the estimation for the underlying data as 
vec(Z*) = (g) A*. More detailed discussions of this 

data reconstruction algorithm can be found in |[T7]| . 

Now there are two questions one must answer to complete 
the design: 1 ) how can one choose ms and mr to guarantee 
reliable data reconstruction for the underlying data field?; and 
2) how can one design the data sampling and MAC protocol 
so that the control center can have sufficient information for 
data reconstruction? We will provide the answers for these 
questions in the remaining of this paper. 
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Fig. 2. Probability of success vs Mthresh with ns = 128 and ut = 128. 

TABLE I 

Calculation of Mthresh, rns and mr 


1 (.ns, Ut) I 

1 (64,64) 1 

1 (64,128) 

1 (64,256) 1 

1 (128,128) 1 

1 (128,256) 1 

1 (256,256) 1 

-Mthresh 

1551 

1892 

2880 

3196 

4620 

9200 

(ms, mT) 

(33,47) 

(22,86) 

(16,180) 

(47,68) 

(30,154) 

(80,115) 


B. Determination of ms and mr 

We would like to choose ms and so that M = msmr 
is minimum. Determination of the optimal values of and 
mr turns out to be a non-trivial task uni- So we propose a 
practical approach to determine ms and ttit. It is intuitive that 
ms and mr should be chosen according to the compressibility 
in the space and time dimensions, respectively. In addition, 
these parameters must be chosen so that the reliability of 
the data reconstruction meets the predetermined requirement, 
which is quantified by the mean square error (MSB). 

To quantify compressibility in the space and time, we 
consider two other design options, viz. temporal CS and spatial 
CS alone. For the former, the control center reconstructs data 
for any particular node by using the observations of only that 
node (i.e., we ignore spatial correlation). For the later, the 
control center reconstructs the data in each RI for all nodes 
without exploiting the correlation over different RIs. For fair 
quantification, we determine the MSB for one data field (i.e., 
for data matrix Z with ns xtit elements) for these two design 
options where MSB = ||Z - Z*\\l / \\Z\\l 

For each spatial CS and temporal CS cases, we generate 
1000 realizations of the injected powers based on which we 
perform the data reconstruction using the ID CS for different 
values of ms and mr, respectively. Then, we obtain the 
empirical probability of success for the data reconstruction 
versus Ms = msnr and Mt = nsmr, respectively where 
the “success” means that the MSB is less than the target 
MSB. From the obtained empirical probability of success, 
we can find the required values of Ms and Mt, which are 
denoted as Ms ^thresh and Mt, thresh, respectively, to achieve 
the target success probability. Having obtained Ms ^thresh and 
Mt, thresh capturing the compressibility in the space and time 
as described above, we choose ms and mr for the 2D CS 
so that msIrriT = ns/nr x Ms^thresh/MT.thresh- Similarly, 
we obtain the empirical probability of success for the 2D 
CS based on which we can determine the minimum value 
of Mthresh = msmT to achieve the target success probability. 

To obtain numerical results in this paper, we choose target 
MSE = 0.05 and target success probability equal 0.95. Fig. [2] 
shows the empirical probability of success versus Mthresh = 
msmT for = ut = 128 where Mthresh in the horizontal 
axis represents the M^^thresh, MT,thresh, and Mthresh for ID 
and 2D CS for simplicity. The data model for the injected 
power will be described in Section IV-AI For this particular 



Fig. 3. Timing diagram of the MAC superframe structure for one RI. 


setting, we can obtain the ratio msjmT = 0.691 from which 
can obtain the values of = 47 and tttt = 68 . 

Similarly, we determine the Mthresh, and ttit for differ¬ 
ent scenarios with the corresponding {us^ut) in Table J] For 
all cases, we can observe that ms < ns and ttit < ffr, which 
demonstrates the benefits of performing data compression 
using the 2D CS. Having determined ms and ttit as described 
above, the remaining tasks are to design the DDR mechanism 
and MAC protocol that are presented in the following. 

IV. DDR And MAC Protocol Design 

A. Distributed Data Reporting Design 

In any RI, to perform reconstruction for the data field 
corresponding to the latest ttt RIs, the control center must 
have data in tttt RIs, each of which comprises ms injected 
powers from ms nodes. With the previously-mentioned rolling 
implementation, the control center can broadcast a message to 
all nodes to request one more data block (the last column of 
data matrix Z) if it has only ttit — 1 data blocks to perform 
reconstruction in the current RI. At the beginning, the control 
center can simply send ttit broadcasts for ttit randomly 
chosen RIs out of ttt RIs and it performs data construction at 
RI ttt upon receiving the requested data blocks. 

B. MAC Protocol Design 

If there is a broadcast message from the control center, we 
assume that each node participates in the contention process 
with probability Ps using the slotted CSMA/CA MAC protocol 
in any RI. The slotted CSMA/CA MAC protocol ifTOl with the 
proposed frame structure is employed for data transmissions as 
described in the following. We set the optionally contention- 
free period to zero since we focus on distributed access 
design in this work. Moreover, we assume there are K^- 
superframes (SFs) in the contention period of any RI where 
SFi = SFq X 2^^^ is the length of SF i, SFq is the base length 
of SF, BOi G [0,50rnax] is the beacon order. Therefore, the 
reporting time (RT) in the underlying RI is will 

optimize the parameters Kr and {BOi} to minimize the RT 
while guaranteeing the desirable data reconstruction quality 
later. The superframe structure in one RI is illustrated in Fig.[3l 

In each SF, the nodes that choose to access the channel (with 
probability Ps) perform the contention using the standardized 
slotted CSMA/CA protocol as follows. A contending node 
randomly chooses a number in [ 0 , Wq] as the backoff counter 
(Wo = and starts counting down. If the counter 

reaches zero, the node will perform the clear channel assess¬ 
ment (CCA) for priority times (we set priority = 2). It will 
transmit data and wait for ACK if all CCAs are successful. The 
reception of ACK is interpreted as a successful transmission. 
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Otherwise this is a collision. In the case of failure in any CCA, 
the node attempts to perform backoff again with doubling 
backoff window. In addition, each node is allowed to access 
channel up to + l times. Since the length of the SF is 
limited, a node may not have enough time to transmit its data 
and ACK packets at the end of the last SF. In this case, we 
assume that the node will wait for the next SF to access the 
channel. We refer to this as the deference state in the following. 

C. MAC Parameter Configuration for Delay Minimization 

We consider optimizing the MAC parameters 
Kr^Ps^iBOi] to minimize the reporting time in each 
RI by solving the following problem: 

min {BOi}) = SF^ x 2 ^°. 

T tPs ? j 

S.t. Pr {-fiisucc ^ ^ P >uff) 

0<Kt< KT,max, 0 < BOi < BOmax, 0 < Ps < 1- 

In®, the first constraint means that the control center must 
receive ms packets (i.e., ms injected power values from ms 
nodes) with probability Psuff ~ 1. Note that one would not 
be able to deterministically guarantee the reception of ms- 
packets due to the random access nature of the DDR and MAC 
protocol. The probability in this constraint can be written as 


transmitting node will not perform contention in the following 
frames. Moreover, Pr{Pi = Ffsucc,i \Ks,i^ hi } is the probabil¬ 
ity that Ffsucc,i nodes transmit successfully in SF i given that 
there are hi contending nodes and Ks^i generic frames. 

In order to calculate Pi {Ki = Ks^i\hi} and 
Pi {^Ki = Ffsucc,i \Ks,i^ }, we have to analyze the Markov 
chain capturing detailed operations of the MAC protocol. For 
simplicity, we set NBi = NB = 5, which is the default value. 
The analysis of the Markov chain model is omitted due to 
the space constraint. Then we determine Pi {Ki = Ks^i \ hi} 
and Pi {^Ki = iTsucc,i \Ks,i^ hi } as follows. 

1) Calculation of Pi {Ki = Ks^i \ hi}: In SF i there are 
hi contending nodes and Ks^i generic frames where frame j 
has length Tij . We can approximate the distribution of generic 
frame length Tij as the normal distribution. So the probability 
of having Ks^i generic frames is written as 

Ks,i 

I>i{Ki=Ks,i \hi}=Pv{^Tij=SFi} = Q{ ) (8) 

j=l 

where Ti and af are the average and variance of the generic 
frame length, respectively whose calculations are presented in 
Appendix O 

2) Calculation of Pi {Ki = Ksucc,i \Ks,i^hi{: The second 
quantity, V = Pi {Ki = Psucc,i \Ks,i, hi } is equal to 


ns h I —I 

Pr{Psucc > rns}= = XTI 

h=ms I^succ='ms ^=1 *=1 

Ks,i ,max 

X = Ks,i\hi }Pr{i?j = iiTsucc.i \Ks,i, hi } ( 6 ) 

^ S ,i—^succ,i 

where Ffsucc denotes the number of successfully transmitted 
packets in the RI and Pi{ms = h} is the probability of h 
nodes joining the contention. Since each node decides to join 
contention with probability Ps, Pi{ms = h} is expressed as 

Pr {ms = /i} = ^ ^ Psy~’" ■ (7) 

In Q and ©, we consider all possible scenarios so that 
the total number of successfully transmitted packets over K^ 
SFs is equal to ATsucc where ATsucc ^ [ms,h]. Here, Arsucc,i 
denotes the number of successfully transmitted packets in SF 
i so that we have ^succ,i = ^succ- In particular, we 

generate all possible combinations of {Arsucc,i} for SFs 
and E represents the set of all possible combinations (|S| is 
the number of possible combinations). 

For each combination, we calculate the probability that 
the control center receives ATsucc successful packets. Note 
that a generic frame may experience one of the follow¬ 
ing events: success, collision, CCA failure and deference. 
Also, there are at most Ks,i,max frames in any SF i where 
^ 5 , 2 , max = [SFi/mm{NB + 1 , Lg + 2}J since the smallest 
length of a CCA failure frame is NB + 1 slots while the 
minimum length of a successful frame is Lg + 2 where Ls 
is the required time for one successful transmission and 2 
represents the two CCA slots. We only consider the case that 

^succ,i < Ks,i < Ks ,i,max: Mi G [l,Kr]. 

In (| 6 ]), Pi{Ki = Ks^i \ hi} is the probability that there are 
Ks^i generic frames in SF i given that hi nodes join contention 
where hi = h and hi = — iTsucc,i-i since successfully 


^S,i ^succ,i 1 

E E 


jf =0 k =0 


^ Ks,i ^ 
Ksucc,i") j-) hfi 


T>Kncc,i jyj Tykryl 

' succ,/ii ' coll,/ii ' d ' ccas,, 


. (9) 


where K^^icc,i-\-j-\-k-\-l = Ksf j, k, and I represent the number 
of frames with collision, deference, and CCA failure, respec¬ 
tively. Moreover, V^uccM^ and Vd denote the 

probabilities of success, collision, CCA failure, and deference, 
respectively, whose calculations are given in Appendix El 
In (|9]), we generate all possible combinations each of which 
has different numbers of success, collision, CCA failure, 
and deference frames. Also, the product behind the double 
summation is the probability of one specific combination. 


D. MAC Parameter Configuration Algorithm 


Algorithm 1 Optimization of MAC Parameters 

I: for each value of K^ G [1, Arr,max] do 

2 : for each possible set {BOi} do 

3: Find optimal ps as ps = argminD (iG^, {BOi} ^Ps)- 


0 <Ps<l 

4: end for 

5: The best (^{BOi} ,ps) for each K^ is (^{BOi} ,ps) = 

argmin V (AT^, {BOi} ,Ps)- 


{BOi},Ps 

6: end for 

7: The final solution (^K^, {BOi} ,ps) is determined as 
{Kr,{BOi} ,ps) = argmin V (Kr, {BOi} ,ps). 

Kr,{B~Oi},Ps 


Since there are only finite number of possible choices for 
Kr G [l,Ffr,max] and the set {BOi}, we can search for the 
optimal value of ps for given Kr and {BOi} as in step 3. 
Then, we search over all possible choices of Kr and the set 
{BOi} to determine the optimal configuration of the MAC 
parameters (in steps 5 and 7). 
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V. Performance Evaluation and Discussion 

A. Data Modeling and Simulation Setting 

Since real data for load and renewable are not available, 
we synthetically generate them by using the existing methods 
ca, na. We set the RI equal 5 minutes and employ the auto¬ 
regressive models ifTSl to synthesize the data. Specifically, we 
utilize the autoregressive process as a time series model which 
consists of the deterministic and stochastic components ifTSll 
as Xt = Xf + X^ where t is the time index, Xf and X^ 
denote the deterministic and stochastic components of Xf, 
respectively. Here, Xf Xf, and X^ are commonly used to 
represent active load power Pi, and reactive load power Qi. 
The deterministic component which depicts the trend of data 
is represented by the trigonometric functions lITSll as follows: 

f • ff 27Tkit\\ 

^. = » + E j j 

where rrih is the number of harmonics (i.e., the number 
of trigonometric functions) where Xre,i and Xim,i are the 
coefficients of the harmonics, xo is the constant, ki < 288/2 
for Vi G [1, rrih]. The stochastic component is modeled by the 
first order autoregressive process AR(1) and expressed as jlSl 
X^_^i = (ptX^ + Ut where (ft is the AR(1) coefficient and 
Ut is the white noise process with zero mean and variance of 
(1 - 

The data set of the active and reactive load powers from 
the years 2006 to 2010 is used to estimate the parameters 
(Xo, Xre,i, Xim,i, ruh, (ft) CSl. We first use the ordinary least 
squares (OLS) to estimate xo, Xre,i, and Xim,i, i ^ 
where Bayesian information criterion (BIC) ifTSll is used to 
determine ruh significant harmonics. Then we also use the 
OLS algorithm to estimate mi- Note that these parameters 
are revised and stored in every 5-minute interval over one day. 
The same time series model described above is used to obtain 
distributed generation power Sg. Then the spatial correlation 
of distributed generation powers is captured as in lIT^ where 
the correlation coefficient between 2 distributed generators i 
and j is pij = exp (—Here d = 20km and dij is 
the distance between generators i and j, which is randomly 
chosen in (0, 1km]. 

In the following simulation, we assume that wind/solar 
generators are installed at half of considered nodes for all 
following experiments. We ignore the node index i in these 
notations for brevity. The RI is set as tt = 5 minutes. The 
target probability in the constraint ^ is chosen as Psuff = 0.9. 
The MAC parameters are chosen as Ls = Tp^IackPLack^ 
Tp = b P Lmac slots {Lmac = 2 is the MAC header). 
Lack = 2 slots, tACK = 1 slot, tACK^u = 4 slots where Tp 
is the length of packet, Iack is the idle time before the ACK, 
Lack is the length of ACK, Iackm is the timeout of the 
ACK. For all the results presented in this section, we choose 
riT = 256. 

B. Numerical Results and Discussion 

1) Sufficient Probability: In Fig. |4(a)[ we show the vari¬ 
ations of PrjiXsucc versus Ps for different values of 

BOi = BO (i.e., all SFs employ the same BO) where 
ms = 16, Kr = 3, and ns = 64. It can be observed that 
there exists an optimal Ps that maximizes PrjiXsucc for 

any value of BO. This optimal value is in the range [0.3,0.5]. 



Fig. 4. PrfiVsucc vs. ps for (a) (ns, ms, Kr) = (64, 16, 3), various 

values of BO, (b) (ns, ms, BO) = (64,16, 3), various values of Kr. 



ns Verr 


(a) (b) 

Fig. 5. Reporting delay V vs. (a) ns, (b) Verr. 


Furthermore, the BO must be sufficiently large (BO > 4) to 
meet the required target value Psuff=0.9. In addition, larger 
values of PO lead to longer SF length, which implies that 
more data packets can be transmitted. In Fig. |4(b)[ we show 
the probability Pr {Psucc > kis} versus ps for different values 
of Kr where we set ns = 64 and BO = 3. This figure 
confirms that the maximum Pr {Psucc ^ kis} becomes larger 
with increasing Kr where we can meet the target probability 
Psuff=0.9 as Kr > 6. 

2) Reporting Delay: We now show the optimal reporting 
delay V in one RI versus the number of nodes ns for 
different schemes, namely TDMA, CSMA, TDMA-CS, and 
our proposed CSMA-CS schemes in Fig. |5(a)[ Here, X-CS 
refers to scheme X that integrates the CS capability and X 
refers to scheme X without CS. Moreover, TDMA is the cen¬ 
tralized non-contention time-division-multiple-access MAC, 
which always achieves better performance CSMA MAC. For 
both CSMA and CSMA-CS schemes, their MAC parameters 
are optimized by using Alg. [B It can be seen that our 
proposed CSMA-CS protocol achieves much smaller delay 
than the CSMA scheme, which confirms the great benefits 
of employing the CS. In addition, TDMA-CS outperforms 
our CSMA-CS protocol since TDMA is a centralized MAC 
while CSMA is a randomized distributed MAC. Finally, this 
figure shows that our CSMA-CS protocol achieves better delay 
performance than the TDMA scheme. 

We illustrate the variations of the optimal reporting de¬ 
lay with Verr fof different schemes where Verr = 1 — 
PrjiXsucc and ns = 0A in Fig. |5(b)[ This figure shows 

that as Verr increases, the reporting delay decreases. This 
indeed presents the tradeoff between the reporting delay and 
data reconstruction error Verr- Note that the delay of TDMA- 
CS protocol is the lower bounds for all other schemes. Inter¬ 
estingly, as Verr mcreases the delay gap between the proposed 
CSMA-CS and the TDMA-CS schemes become smaller. 

3) Bandwidth Usage: To ease the exposition, we do not 
show the performance of bandwidth usage for the CSMA and 
TDMA-CS schemes. We consider a particular neighborhood 
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N Target delay (slots) 


(a) (b) 

Fig. 6. Number of channels vs. (a) N, (b) target delay. 

with N > ns nodes, whose simultaneous transmissions can 
collide with one another. Suppose these N nodes want to 
report data to a control center. In this case, we would need 
N/ns orthogonal channel^ to support these communications. 
Suppose that the considered smartgrid application has a max¬ 
imum target delay of Pm ax determined by using the results in 
Fig. |5(a)| We should design the group size with ns nodes as 
large as possible while respecting this target delay. 

Let the maximum numbers of nodes for one group under 
TDMA and CSMA-CS schemes while still respecting the 
target delay be n^^^^ and respectively. Note that 

the TDMA scheme uses all nr RIs for data transmission while 
the CSMA-CS scheme only chooses mr RIs for each nr 
RIs to transmit the data. Thus, the CSMA-CS scheme allows 
nrlruT groups to share one channel for each interval of nr 
RIs. As a result, the number of channels needed for N nodes is 
for the TDMA scheme and N /x niTlriT 
for the CSMA-CS scheme. Specifically, we have n^^^^ = 
{40, 75} and {55,128} for TDMA and CSMA- 

CS schemes for the target delay values of Pmax = {400, 750} 
slots, respectively. Then we calculate the required bandwidth 
(i.e., number of channels) for TDMA and CSMA-CS schemes 
with a given number of nodes N. 

In Fig. |6(a)[ we show the required number of channels ver¬ 
sus N. It can be observed that our proposed CSMA-CS scheme 
requires less than half of the bandwidth demanded by the 
TDMA scheme. Also when the network requires smaller target 
delay, we need more channels for both schemes as expected. 
Finally, Fig. |6(b)| describes the variations in the number of 
channels versus the target delay for N = {2048,8192}. Again, 
our proposed CSMA-CS scheme provides excellent bandwidth 
saving compared to the TDMA scheme. 

VI. Conclusion 

We have proposed the joint design of data compression 
using the CS techniques and CSMA MAC protocol for smart- 
grids with renewable energy. Then, we have presented the 
design and optimization of the MAC protocol to minimize 
the reporting delay. Numerical results have confirmed the sig¬ 
nificant performance gains of the proposed design compared 
to other non-compressed solutions. 

Appendix A 

Calculation of 7 } and af 

In this appendix, we determine 7 } and af for a particular 
SF i. For simplicity, we again omit the index i and hi in 
all related parameters when this does not create confusion. 
First, we can express the probability generating function (PGF) 

^We ignore the fact that this number must be integer for simplicity 


of the generic frame, T{z) which includes success, collision, 
CCA failure and deference, as 

T [z) = VsucqTs {z) + VcowTc (z) + VccslsTf (z) + VdTo (z) (11) 

Here we denote Thecas, 'PcoW and Vsucc as the probabilities 
of CCA failure, collision and success, respectively. These 
probabilities can be calculated as Vecas = (1 — 

^Oii = Pc (1 - Pd) (1 - A^^+1), and Psucc = 1 - Peon - 
Pceas — Pd, where Pc = 1 — {1 — 4>)^~^. Moreover, we also 
denote Ts (z), Tc (z), Tp (z) and Tp {z) as the PGFs of 
durations of success, collision, CCA failure and deference, 
respectively. These quantities can be calculated as in lIT^ . 

Finally, we can determine T and from the first and 
second derivation of T ( 2 ;) at z = 1, i.e.. 

These parameters T and cr^ will be utilized in (O. 
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